Dataset statistics
| Number of variables | 42 |
|---|---|
| Number of observations | 754472 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 679.6 MiB |
| Average record size in memory | 944.5 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 32 |
df_index is highly correlated with building_id and 3 other fields | High correlation |
building_id is highly correlated with df_index and 3 other fields | High correlation |
district_id is highly correlated with df_index and 3 other fields | High correlation |
vdcmun_id is highly correlated with df_index and 3 other fields | High correlation |
ward_id is highly correlated with df_index and 3 other fields | High correlation |
df_index is uniformly distributed | Uniform |
df_index has unique values | Unique |
building_id has unique values | Unique |
count_families has 70842 (9.4%) zeros | Zeros |
Reproduction
| Analysis started | 2021-08-09 15:09:59.368134 |
|---|---|
| Analysis finished | 2021-08-09 15:15:33.869906 |
| Duration | 5 minutes and 34.5 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 754472 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 381069.7559 |
|---|---|
| Minimum | 0 |
| Maximum | 762105 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 38105.55 |
| Q1 | 190567.75 |
| median | 381073.5 |
| Q3 | 571576.25 |
| 95-th percentile | 724013.45 |
| Maximum | 762105 |
| Range | 762105 |
| Interquartile range (IQR) | 381008.5 |
Descriptive statistics
| Standard deviation | 219996.1296 |
|---|---|
| Coefficient of variation (CV) | 0.5773119651 |
| Kurtosis | -1.199900889 |
| Mean | 381069.7559 |
| Median Absolute Deviation (MAD) | 190504.5 |
| Skewness | -4.62529378 × 105 |
| Sum | 2.875064609 × 1011 |
| Variance | 4.839829703 × 1010 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 115565 | 1 | < 0.1% |
| 101220 | 1 | < 0.1% |
| 99173 | 1 | < 0.1% |
| 105318 | 1 | < 0.1% |
| 103271 | 1 | < 0.1% |
| 125800 | 1 | < 0.1% |
| 123753 | 1 | < 0.1% |
| 129898 | 1 | < 0.1% |
| 127851 | 1 | < 0.1% |
| Other values (754462) | 754462 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 762105 | 1 | |
| 762104 | 1 | |
| 762103 | 1 | |
| 762102 | 1 | |
| 762101 | 1 | |
| 762100 | 1 | |
| 762099 | 1 | |
| 762098 | 1 | |
| 762097 | 1 | |
| 762096 | 1 |
| Distinct | 754472 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.607553746 × 1011 |
|---|---|
| Minimum | 1.20101 × 1011 |
| Maximum | 3.667090013 × 1011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 1.20101 × 1011 |
|---|---|
| 5-th percentile | 1.255030005 × 1011 |
| Q1 | 2.219090006 × 1011 |
| median | 2.463020003 × 1011 |
| Q3 | 3.036080011 × 1011 |
| 95-th percentile | 3.63801001 × 1011 |
| Maximum | 3.667090013 × 1011 |
| Range | 2.466080013 × 1011 |
| Interquartile range (IQR) | 8.169900042 × 1010 |
Descriptive statistics
| Standard deviation | 5.801936234 × 1010 |
|---|---|
| Coefficient of variation (CV) | 0.2225049529 |
| Kurtosis | -0.08249809674 |
| Mean | 2.607553746 × 1011 |
| Median Absolute Deviation (MAD) | 3.980000151 × 1010 |
| Skewness | -0.1613184247 |
| Sum | 1.96732629 × 1017 |
| Variance | 3.366246407 × 1021 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.036010005 × 1011 | 1 | < 0.1% |
| 2.804050128 × 1011 | 1 | < 0.1% |
| 2.048040002 × 1011 | 1 | < 0.1% |
| 2.143080001 × 1011 | 1 | < 0.1% |
| 2.423070114 × 1011 | 1 | < 0.1% |
| 3.108070018 × 1011 | 1 | < 0.1% |
| 3.113090003 × 1011 | 1 | < 0.1% |
| 2.203080012 × 1011 | 1 | < 0.1% |
| 3.642030001 × 1011 | 1 | < 0.1% |
| 2.40401001 × 1011 | 1 | < 0.1% |
| Other values (754462) | 754462 |
| Value | Count | Frequency (%) |
| 1.20101 × 1011 | 1 | |
| 1.20101 × 1011 | 1 | |
| 1.20101 × 1011 | 1 | |
| 1.20101 × 1011 | 1 | |
| 1.201010001 × 1011 | 1 | |
| 1.201010001 × 1011 | 1 | |
| 1.201010001 × 1011 | 1 | |
| 1.201010001 × 1011 | 1 | |
| 1.201010001 × 1011 | 1 | |
| 1.201010001 × 1011 | 1 |
| Value | Count | Frequency (%) |
| 3.667090013 × 1011 | 1 | |
| 3.667090013 × 1011 | 1 | |
| 3.667090013 × 1011 | 1 | |
| 3.667090013 × 1011 | 1 | |
| 3.667090012 × 1011 | 1 | |
| 3.667090012 × 1011 | 1 | |
| 3.667090012 × 1011 | 1 | |
| 3.667090012 × 1011 | 1 | |
| 3.667090012 × 1011 | 1 | |
| 3.667090012 × 1011 | 1 |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.76821141 |
|---|---|
| Minimum | 12 |
| Maximum | 36 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 12 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 22 |
| median | 24 |
| Q3 | 30 |
| 95-th percentile | 36 |
| Maximum | 36 |
| Range | 24 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 5.807650562 |
|---|---|
| Coefficient of variation (CV) | 0.2253804298 |
| Kurtosis | -0.1225071928 |
| Mean | 25.76821141 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.1553353218 |
| Sum | 19441394 |
| Variance | 33.72880505 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 97071 | |
| 31 | 90076 | |
| 30 | 88219 | |
| 23 | 87842 | |
| 36 | 77317 | |
| 28 | 76369 | |
| 20 | 68035 | |
| 22 | 60050 | |
| 21 | 58026 | |
| 12 | 38955 |
| Value | Count | Frequency (%) |
| 12 | 38955 | |
| 20 | 68035 | |
| 21 | 58026 | |
| 22 | 60050 | |
| 23 | 87842 | |
| 24 | 97071 | |
| 28 | 76369 | |
| 29 | 12512 | 1.7% |
| 30 | 88219 | |
| 31 | 90076 |
| Value | Count | Frequency (%) |
| 36 | 77317 | |
| 31 | 90076 | |
| 30 | 88219 | |
| 29 | 12512 | 1.7% |
| 28 | 76369 | |
| 24 | 97071 | |
| 23 | 87842 | |
| 22 | 60050 | |
| 21 | 58026 | |
| 20 | 68035 |
| Distinct | 110 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2582.726213 |
|---|---|
| Minimum | 1201 |
| Maximum | 3611 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 1201 |
|---|---|
| 5-th percentile | 1208 |
| Q1 | 2204 |
| median | 2410 |
| Q3 | 3010 |
| 95-th percentile | 3608 |
| Maximum | 3611 |
| Range | 2410 |
| Interquartile range (IQR) | 806 |
Descriptive statistics
| Standard deviation | 581.1821751 |
|---|---|
| Coefficient of variation (CV) | 0.2250266297 |
| Kurtosis | -0.1218437698 |
| Mean | 2582.726213 |
| Median Absolute Deviation (MAD) | 401 |
| Skewness | -0.1563562265 |
| Sum | 1948594611 |
| Variance | 337772.7207 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3104 | 32362 | 4.3% |
| 2005 | 15486 | 2.1% |
| 3009 | 15076 | 2.0% |
| 2802 | 15045 | 2.0% |
| 2001 | 14862 | 2.0% |
| 2304 | 14431 | 1.9% |
| 2310 | 13831 | 1.8% |
| 2105 | 13389 | 1.8% |
| 3608 | 12969 | 1.7% |
| 2410 | 12145 | 1.6% |
| Other values (100) | 594876 |
| Value | Count | Frequency (%) |
| 1201 | 4815 | 0.6% |
| 1202 | 3674 | 0.5% |
| 1203 | 3916 | 0.5% |
| 1204 | 3848 | 0.5% |
| 1205 | 5215 | 0.7% |
| 1206 | 4657 | 0.6% |
| 1207 | 7703 | |
| 1208 | 5127 | 0.7% |
| 2001 | 14862 | |
| 2002 | 2991 | 0.4% |
| Value | Count | Frequency (%) |
| 3611 | 7131 | |
| 3610 | 7821 | |
| 3609 | 11616 | |
| 3608 | 12969 | |
| 3607 | 6219 | |
| 3606 | 3648 | 0.5% |
| 3605 | 2724 | 0.4% |
| 3604 | 6691 | |
| 3603 | 7245 | |
| 3602 | 4128 | 0.5% |
| Distinct | 945 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 258278.0626 |
|---|---|
| Minimum | 120101 |
| Maximum | 361108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 120101 |
|---|---|
| 5-th percentile | 120808 |
| Q1 | 220402 |
| median | 241004 |
| Q3 | 301006 |
| 95-th percentile | 360803 |
| Maximum | 361108 |
| Range | 241007 |
| Interquartile range (IQR) | 80604 |
Descriptive statistics
| Standard deviation | 58118.28875 |
|---|---|
| Coefficient of variation (CV) | 0.2250221647 |
| Kurtosis | -0.1218507637 |
| Mean | 258278.0626 |
| Median Absolute Deviation (MAD) | 40101 |
| Skewness | -0.1563602283 |
| Sum | 1.948635665 × 1011 |
| Variance | 3377735487 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 310405 | 2559 | 0.3% |
| 310404 | 2408 | 0.3% |
| 310412 | 2317 | 0.3% |
| 200506 | 2026 | 0.3% |
| 310411 | 1982 | 0.3% |
| 310419 | 1957 | 0.3% |
| 310415 | 1913 | 0.3% |
| 241004 | 1864 | 0.2% |
| 310409 | 1840 | 0.2% |
| 300903 | 1822 | 0.2% |
| Other values (935) | 733784 |
| Value | Count | Frequency (%) |
| 120101 | 598 | |
| 120102 | 564 | |
| 120103 | 435 | |
| 120104 | 558 | |
| 120105 | 437 | |
| 120106 | 638 | |
| 120107 | 420 | |
| 120108 | 301 | |
| 120109 | 288 | |
| 120110 | 576 |
| Value | Count | Frequency (%) |
| 361108 | 911 | |
| 361107 | 931 | |
| 361106 | 927 | |
| 361105 | 904 | |
| 361104 | 919 | |
| 361103 | 1096 | |
| 361102 | 817 | |
| 361101 | 626 | |
| 361009 | 994 | |
| 361008 | 825 |
legal_ownership_status
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 46.1 MiB |
| Private | |
|---|---|
| Public | 19010 |
| Institutional | 7744 |
| Other | 3621 |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 7.026789596 |
| Min length | 5 |
Characters and Unicode
| Total characters | 5301516 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Private |
|---|---|
| 2nd row | Private |
| 3rd row | Private |
| 4th row | Private |
| 5th row | Private |
| Value | Count | Frequency (%) |
| Private | 724097 | |
| Public | 19010 | 2.5% |
| Institutional | 7744 | 1.0% |
| Other | 3621 | 0.5% |
| Value | Count | Frequency (%) |
| private | 724097 | |
| public | 19010 | 2.5% |
| institutional | 7744 | 1.0% |
| other | 3621 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 758595 | |
| t | 750950 | |
| P | 743107 | |
| a | 731841 | |
| r | 727718 | |
| e | 727718 | |
| v | 724097 | |
| u | 26754 | 0.5% |
| l | 26754 | 0.5% |
| b | 19010 | 0.4% |
| Other values (7) | 64972 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4547044 | |
| Uppercase Letter | 754472 | 14.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| i | 758595 | |
| t | 750950 | |
| a | 731841 | |
| r | 727718 | |
| e | 727718 | |
| v | 724097 | |
| u | 26754 | 0.6% |
| l | 26754 | 0.6% |
| b | 19010 | 0.4% |
| c | 19010 | 0.4% |
| Other values (4) | 34597 | 0.8% |
| Value | Count | Frequency (%) |
| P | 743107 | |
| I | 7744 | 1.0% |
| O | 3621 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5301516 |
Most frequent character per script
| Value | Count | Frequency (%) |
| i | 758595 | |
| t | 750950 | |
| P | 743107 | |
| a | 731841 | |
| r | 727718 | |
| e | 727718 | |
| v | 724097 | |
| u | 26754 | 0.5% |
| l | 26754 | 0.5% |
| b | 19010 | 0.4% |
| Other values (7) | 64972 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5301516 |
Most frequent character per block
| Value | Count | Frequency (%) |
| i | 758595 | |
| t | 750950 | |
| P | 743107 | |
| a | 731841 | |
| r | 727718 | |
| e | 727718 | |
| v | 724097 | |
| u | 26754 | 0.5% |
| l | 26754 | 0.5% |
| b | 19010 | 0.4% |
| Other values (7) | 64972 | 1.2% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9806487186 |
|---|---|
| Minimum | 0 |
| Maximum | 11 |
| Zeros | 70842 |
| Zeros (%) | 9.4% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4501782998 |
|---|---|
| Coefficient of variation (CV) | 0.4590617325 |
| Kurtosis | 15.08826622 |
| Mean | 0.9806487186 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.496671244 |
| Sum | 739872 |
| Variance | 0.2026605016 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 637014 | |
| 0 | 70842 | 9.4% |
| 2 | 39343 | 5.2% |
| 3 | 5615 | 0.7% |
| 4 | 1204 | 0.2% |
| 5 | 299 | < 0.1% |
| 6 | 104 | < 0.1% |
| 7 | 27 | < 0.1% |
| 8 | 15 | < 0.1% |
| 9 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 70842 | 9.4% |
| 1 | 637014 | |
| 2 | 39343 | 5.2% |
| 3 | 5615 | 0.7% |
| 4 | 1204 | 0.2% |
| 5 | 299 | < 0.1% |
| 6 | 104 | < 0.1% |
| 7 | 27 | < 0.1% |
| 8 | 15 | < 0.1% |
| 9 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 11 | 1 | < 0.1% |
| 9 | 8 | < 0.1% |
| 8 | 15 | < 0.1% |
| 7 | 27 | < 0.1% |
| 6 | 104 | < 0.1% |
| 5 | 299 | < 0.1% |
| 4 | 1204 | 0.2% |
| 3 | 5615 | 0.7% |
| 2 | 39343 | 5.2% |
| 1 | 637014 |
has_secondary_use
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 43.2 MiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2263416 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 663061 | |
| 1.0 | 91411 | 12.1% |
| Value | Count | Frequency (%) |
| 0.0 | 663061 | |
| 1.0 | 91411 | 12.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1417533 | |
| . | 754472 | |
| 1 | 91411 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1508944 | |
| Other Punctuation | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 1417533 | |
| 1 | 91411 | 6.1% |
| Value | Count | Frequency (%) |
| . | 754472 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2263416 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 1417533 | |
| . | 754472 | |
| 1 | 91411 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2263416 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 1417533 | |
| . | 754472 | |
| 1 | 91411 | 4.0% |
has_secondary_use_agriculture
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 54206 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 700266 | |
| 1 | 54206 | 7.2% |
| Value | Count | Frequency (%) |
| 0 | 700266 | |
| 1 | 54206 | 7.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 700266 | |
| 1 | 54206 | 7.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 700266 | |
| 1 | 54206 | 7.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 700266 | |
| 1 | 54206 | 7.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 700266 | |
| 1 | 54206 | 7.2% |
has_secondary_use_hotel
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 26458 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 728014 | |
| 1 | 26458 | 3.5% |
| Value | Count | Frequency (%) |
| 0 | 728014 | |
| 1 | 26458 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 728014 | |
| 1 | 26458 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 728014 | |
| 1 | 26458 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 728014 | |
| 1 | 26458 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 728014 | |
| 1 | 26458 | 3.5% |
has_secondary_use_rental
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 6236 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 748236 | |
| 1 | 6236 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 748236 | |
| 1 | 6236 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 748236 | |
| 1 | 6236 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 748236 | |
| 1 | 6236 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 748236 | |
| 1 | 6236 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 748236 | |
| 1 | 6236 | 0.8% |
has_secondary_use_institution
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 876 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 753596 | |
| 1 | 876 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 753596 | |
| 1 | 876 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 753596 | |
| 1 | 876 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 753596 | |
| 1 | 876 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 753596 | |
| 1 | 876 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 753596 | |
| 1 | 876 | 0.1% |
has_secondary_use_school
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 318 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 754154 | |
| 1 | 318 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 754154 | |
| 1 | 318 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 754154 | |
| 1 | 318 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 754154 | |
| 1 | 318 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 754154 | |
| 1 | 318 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 754154 | |
| 1 | 318 | < 0.1% |
has_secondary_use_industry
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 883 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 753589 | |
| 1 | 883 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 753589 | |
| 1 | 883 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 753589 | |
| 1 | 883 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 753589 | |
| 1 | 883 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 753589 | |
| 1 | 883 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 753589 | |
| 1 | 883 | 0.1% |
has_secondary_use_health_post
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 171 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 754301 | |
| 1 | 171 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 754301 | |
| 1 | 171 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 754301 | |
| 1 | 171 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 754301 | |
| 1 | 171 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 754301 | |
| 1 | 171 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 754301 | |
| 1 | 171 | < 0.1% |
has_secondary_use_gov_office
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 140 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 754332 | |
| 1 | 140 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 754332 | |
| 1 | 140 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 754332 | |
| 1 | 140 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 754332 | |
| 1 | 140 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 754332 | |
| 1 | 140 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 754332 | |
| 1 | 140 | < 0.1% |
has_secondary_use_use_police
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 74 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 754398 | |
| 1 | 74 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 754398 | |
| 1 | 74 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 754398 | |
| 1 | 74 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 754398 | |
| 1 | 74 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 754398 | |
| 1 | 74 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 754398 | |
| 1 | 74 | < 0.1% |
has_secondary_use_other
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 3387 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 751085 | |
| 1 | 3387 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 751085 | |
| 1 | 3387 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 751085 | |
| 1 | 3387 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 751085 | |
| 1 | 3387 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 751085 | |
| 1 | 3387 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 751085 | |
| 1 | 3387 | 0.4% |
count_floors_pre_eq
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.087832285 |
|---|---|
| Minimum | 1 |
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6551028871 |
|---|---|
| Coefficient of variation (CV) | 0.3137717966 |
| Kurtosis | 1.607943784 |
| Mean | 2.087832285 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.4261028197 |
| Sum | 1575211 |
| Variance | 0.4291597927 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 463348 | |
| 3 | 165377 | 21.9% |
| 1 | 117726 | 15.6% |
| 4 | 6030 | 0.8% |
| 5 | 1553 | 0.2% |
| 6 | 329 | < 0.1% |
| 7 | 85 | < 0.1% |
| 8 | 12 | < 0.1% |
| 9 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 117726 | 15.6% |
| 2 | 463348 | |
| 3 | 165377 | 21.9% |
| 4 | 6030 | 0.8% |
| 5 | 1553 | 0.2% |
| 6 | 329 | < 0.1% |
| 7 | 85 | < 0.1% |
| 8 | 12 | < 0.1% |
| 9 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 12 | < 0.1% |
| 8 | 12 | < 0.1% |
| 7 | 85 | < 0.1% |
| 6 | 329 | < 0.1% |
| 5 | 1553 | 0.2% |
| 4 | 6030 | 0.8% |
| 3 | 165377 | 21.9% |
| 2 | 463348 | |
| 1 | 117726 | 15.6% |
age_building
Real number (ℝ≥0)
| Distinct | 176 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.33110175 |
|---|---|
| Minimum | 0 |
| Maximum | 999 |
| Zeros | 4693 |
| Zeros (%) | 0.6% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 9 |
| median | 16 |
| Q3 | 27 |
| 95-th percentile | 54 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 65.06845647 |
|---|---|
| Coefficient of variation (CV) | 2.674291413 |
| Kurtosis | 204.9754775 |
| Mean | 24.33110175 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 13.90794554 |
| Sum | 18357135 |
| Variance | 4233.904027 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 15 | 49473 | 6.6% |
| 20 | 46042 | 6.1% |
| 10 | 39369 | 5.2% |
| 25 | 36681 | 4.9% |
| 12 | 36087 | 4.8% |
| 30 | 30546 | 4.0% |
| 5 | 28875 | 3.8% |
| 3 | 24202 | 3.2% |
| 4 | 23184 | 3.1% |
| 7 | 23151 | 3.1% |
| Other values (166) | 416862 |
| Value | Count | Frequency (%) |
| 0 | 4693 | 0.6% |
| 1 | 19174 | |
| 2 | 21399 | |
| 3 | 24202 | |
| 4 | 23184 | |
| 5 | 28875 | |
| 6 | 19937 | |
| 7 | 23151 | |
| 8 | 22630 | |
| 9 | 14995 |
| Value | Count | Frequency (%) |
| 999 | 3116 | |
| 200 | 259 | < 0.1% |
| 199 | 1 | < 0.1% |
| 196 | 2 | < 0.1% |
| 195 | 2 | < 0.1% |
| 193 | 1 | < 0.1% |
| 190 | 7 | < 0.1% |
| 187 | 1 | < 0.1% |
| 185 | 1 | < 0.1% |
| 180 | 20 | < 0.1% |
plinth_area_sq_ft
Real number (ℝ≥0)
| Distinct | 2123 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 406.6594956 |
|---|---|
| Minimum | 70 |
| Maximum | 5000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 70 |
|---|---|
| 5-th percentile | 176 |
| Q1 | 280 |
| median | 358 |
| Q3 | 470 |
| 95-th percentile | 800 |
| Maximum | 5000 |
| Range | 4930 |
| Interquartile range (IQR) | 190 |
Descriptive statistics
| Standard deviation | 226.7757616 |
|---|---|
| Coefficient of variation (CV) | 0.557655149 |
| Kurtosis | 34.2743952 |
| Mean | 406.6594956 |
| Median Absolute Deviation (MAD) | 92 |
| Skewness | 3.8388692 |
| Sum | 306813203 |
| Variance | 51427.24607 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 300 | 27382 | 3.6% |
| 450 | 21086 | 2.8% |
| 400 | 19744 | 2.6% |
| 350 | 18985 | 2.5% |
| 360 | 14825 | 2.0% |
| 250 | 13848 | 1.8% |
| 280 | 13771 | 1.8% |
| 200 | 12326 | 1.6% |
| 320 | 11739 | 1.6% |
| 420 | 10827 | 1.4% |
| Other values (2113) | 589939 |
| Value | Count | Frequency (%) |
| 70 | 111 | |
| 71 | 4 | < 0.1% |
| 72 | 56 | |
| 73 | 15 | < 0.1% |
| 74 | 4 | < 0.1% |
| 75 | 101 | |
| 76 | 2 | < 0.1% |
| 77 | 26 | < 0.1% |
| 78 | 37 | < 0.1% |
| 79 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 5000 | 6 | |
| 4995 | 1 | < 0.1% |
| 4928 | 1 | < 0.1% |
| 4901 | 1 | < 0.1% |
| 4890 | 1 | < 0.1% |
| 4800 | 1 | < 0.1% |
| 4795 | 1 | < 0.1% |
| 4738 | 1 | < 0.1% |
| 4701 | 1 | < 0.1% |
| 4652 | 1 | < 0.1% |
height_ft_pre_eq
Real number (ℝ≥0)
| Distinct | 79 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.04916021 |
|---|---|
| Minimum | 6 |
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 5.8 MiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 12 |
| median | 16 |
| Q3 | 18 |
| 95-th percentile | 24 |
| Maximum | 99 |
| Range | 93 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.493172547 |
|---|---|
| Coefficient of variation (CV) | 0.3422716501 |
| Kurtosis | 25.72176514 |
| Mean | 16.04916021 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.493579751 |
| Sum | 12108642 |
| Variance | 30.17494463 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 101056 | |
| 14 | 94042 | |
| 12 | 80975 | |
| 16 | 73701 | |
| 15 | 59447 | 7.9% |
| 20 | 49607 | 6.6% |
| 21 | 37396 | 5.0% |
| 10 | 29831 | 4.0% |
| 17 | 28309 | 3.8% |
| 13 | 24903 | 3.3% |
| Other values (69) | 175205 |
| Value | Count | Frequency (%) |
| 6 | 9471 | 1.3% |
| 7 | 17276 | 2.3% |
| 8 | 21192 | 2.8% |
| 9 | 21672 | 2.9% |
| 10 | 29831 | 4.0% |
| 11 | 11081 | 1.5% |
| 12 | 80975 | |
| 13 | 24903 | 3.3% |
| 14 | 94042 | |
| 15 | 59447 |
| Value | Count | Frequency (%) |
| 99 | 300 | |
| 97 | 1 | < 0.1% |
| 96 | 2 | < 0.1% |
| 95 | 2 | < 0.1% |
| 93 | 1 | < 0.1% |
| 90 | 3 | < 0.1% |
| 89 | 1 | < 0.1% |
| 85 | 3 | < 0.1% |
| 81 | 1 | < 0.1% |
| 80 | 3 | < 0.1% |
land_surface_condition
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.1 MiB |
| Flat | |
|---|---|
| Moderate slope | |
| Steep slope | 24554 |
Length
| Max length | 14 |
|---|---|
| Median length | 4 |
| Mean length | 5.613417065 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4235166 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Flat |
|---|---|
| 2nd row | Moderate slope |
| 3rd row | Flat |
| 4th row | Flat |
| 5th row | Flat |
| Value | Count | Frequency (%) |
| Flat | 625378 | |
| Moderate slope | 104540 | 13.9% |
| Steep slope | 24554 | 3.3% |
| Value | Count | Frequency (%) |
| flat | 625378 | |
| slope | 129094 | 14.6% |
| moderate | 104540 | 11.8% |
| steep | 24554 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 754472 | |
| t | 754472 | |
| a | 729918 | |
| F | 625378 | |
| e | 387282 | |
| o | 233634 | 5.5% |
| p | 153648 | 3.6% |
| 129094 | 3.0% | |
| s | 129094 | 3.0% |
| M | 104540 | 2.5% |
| Other values (3) | 233634 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3351600 | |
| Uppercase Letter | 754472 | 17.8% |
| Space Separator | 129094 | 3.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| l | 754472 | |
| t | 754472 | |
| a | 729918 | |
| e | 387282 | |
| o | 233634 | 7.0% |
| p | 153648 | 4.6% |
| s | 129094 | 3.9% |
| d | 104540 | 3.1% |
| r | 104540 | 3.1% |
| Value | Count | Frequency (%) |
| F | 625378 | |
| M | 104540 | 13.9% |
| S | 24554 | 3.3% |
| Value | Count | Frequency (%) |
| 129094 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4106072 | |
| Common | 129094 | 3.0% |
Most frequent character per script
| Value | Count | Frequency (%) |
| l | 754472 | |
| t | 754472 | |
| a | 729918 | |
| F | 625378 | |
| e | 387282 | |
| o | 233634 | 5.7% |
| p | 153648 | 3.7% |
| s | 129094 | 3.1% |
| M | 104540 | 2.5% |
| d | 104540 | 2.5% |
| Other values (2) | 129094 | 3.1% |
| Value | Count | Frequency (%) |
| 129094 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4235166 |
Most frequent character per block
| Value | Count | Frequency (%) |
| l | 754472 | |
| t | 754472 | |
| a | 729918 | |
| F | 625378 | |
| e | 387282 | |
| o | 233634 | 5.5% |
| p | 153648 | 3.6% |
| 129094 | 3.0% | |
| s | 129094 | 3.0% |
| M | 104540 | 2.5% |
| Other values (3) | 233634 | 5.5% |
foundation_type
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.5 MiB |
| Mud mortar-Stone/Brick | |
|---|---|
| Bamboo/Timber | 56860 |
| Cement-Stone/Brick | 38843 |
| RC | 31819 |
| Other | 4518 |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 20.1705113 |
| Min length | 2 |
Characters and Unicode
| Total characters | 15218086 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RC |
|---|---|
| 2nd row | Mud mortar-Stone/Brick |
| 3rd row | Mud mortar-Stone/Brick |
| 4th row | Mud mortar-Stone/Brick |
| 5th row | Mud mortar-Stone/Brick |
| Value | Count | Frequency (%) |
| Mud mortar-Stone/Brick | 622432 | |
| Bamboo/Timber | 56860 | 7.5% |
| Cement-Stone/Brick | 38843 | 5.1% |
| RC | 31819 | 4.2% |
| Other | 4518 | 0.6% |
| Value | Count | Frequency (%) |
| mud | 622432 | |
| mortar-stone/brick | 622432 | |
| bamboo/timber | 56860 | 4.1% |
| cement-stone/brick | 38843 | 2.8% |
| rc | 31819 | 2.3% |
| other | 4518 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 1967517 | 12.9% |
| o | 1397427 | 9.2% |
| t | 1327068 | 8.7% |
| e | 800339 | 5.3% |
| m | 774995 | 5.1% |
| / | 718135 | 4.7% |
| B | 718135 | 4.7% |
| i | 718135 | 4.7% |
| n | 700118 | 4.6% |
| a | 679292 | 4.5% |
| Other values (14) | 5416925 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11050543 | |
| Uppercase Letter | 2165701 | 14.2% |
| Other Punctuation | 718135 | 4.7% |
| Dash Punctuation | 661275 | 4.3% |
| Space Separator | 622432 | 4.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| r | 1967517 | |
| o | 1397427 | |
| t | 1327068 | |
| e | 800339 | |
| m | 774995 | 7.0% |
| i | 718135 | 6.5% |
| n | 700118 | 6.3% |
| a | 679292 | 6.1% |
| c | 661275 | 6.0% |
| k | 661275 | 6.0% |
| Other values (4) | 1363102 |
| Value | Count | Frequency (%) |
| B | 718135 | |
| S | 661275 | |
| M | 622432 | |
| C | 70662 | 3.3% |
| T | 56860 | 2.6% |
| R | 31819 | 1.5% |
| O | 4518 | 0.2% |
| Value | Count | Frequency (%) |
| 622432 |
| Value | Count | Frequency (%) |
| - | 661275 |
| Value | Count | Frequency (%) |
| / | 718135 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13216244 | |
| Common | 2001842 | 13.2% |
Most frequent character per script
| Value | Count | Frequency (%) |
| r | 1967517 | |
| o | 1397427 | |
| t | 1327068 | 10.0% |
| e | 800339 | 6.1% |
| m | 774995 | 5.9% |
| B | 718135 | 5.4% |
| i | 718135 | 5.4% |
| n | 700118 | 5.3% |
| a | 679292 | 5.1% |
| S | 661275 | 5.0% |
| Other values (11) | 3471943 |
| Value | Count | Frequency (%) |
| / | 718135 | |
| - | 661275 | |
| 622432 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15218086 |
Most frequent character per block
| Value | Count | Frequency (%) |
| r | 1967517 | 12.9% |
| o | 1397427 | 9.2% |
| t | 1327068 | 8.7% |
| e | 800339 | 5.3% |
| m | 774995 | 5.1% |
| / | 718135 | 4.7% |
| B | 718135 | 4.7% |
| i | 718135 | 4.7% |
| n | 700118 | 4.6% |
| a | 679292 | 4.5% |
| Other values (14) | 5416925 |
roof_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.7 MiB |
| Bamboo/Timber-Light roof | |
|---|---|
| Bamboo/Timber-Heavy roof | |
| RCC/RB/RBC | 44141 |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.18091858 |
| Min length | 10 |
Characters and Unicode
| Total characters | 17489354 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RCC/RB/RBC |
|---|---|
| 2nd row | Bamboo/Timber-Light roof |
| 3rd row | Bamboo/Timber-Heavy roof |
| 4th row | Bamboo/Timber-Heavy roof |
| 5th row | Bamboo/Timber-Light roof |
| Value | Count | Frequency (%) |
| Bamboo/Timber-Light roof | 498705 | |
| Bamboo/Timber-Heavy roof | 211626 | |
| RCC/RB/RBC | 44141 | 5.9% |
| Value | Count | Frequency (%) |
| roof | 710331 | |
| bamboo/timber-light | 498705 | |
| bamboo/timber-heavy | 211626 | 14.4% |
| rcc/rb/rbc | 44141 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2841324 | |
| m | 1420662 | 8.1% |
| b | 1420662 | 8.1% |
| r | 1420662 | 8.1% |
| i | 1209036 | 6.9% |
| a | 921957 | 5.3% |
| e | 921957 | 5.3% |
| / | 798613 | 4.6% |
| B | 798613 | 4.6% |
| T | 710331 | 4.1% |
| Other values (12) | 5025537 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12785958 | |
| Uppercase Letter | 2484121 | 14.2% |
| Other Punctuation | 798613 | 4.6% |
| Dash Punctuation | 710331 | 4.1% |
| Space Separator | 710331 | 4.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| o | 2841324 | |
| m | 1420662 | |
| b | 1420662 | |
| r | 1420662 | |
| i | 1209036 | |
| a | 921957 | 7.2% |
| e | 921957 | 7.2% |
| f | 710331 | 5.6% |
| g | 498705 | 3.9% |
| h | 498705 | 3.9% |
| Other values (3) | 921957 | 7.2% |
| Value | Count | Frequency (%) |
| B | 798613 | |
| T | 710331 | |
| L | 498705 | |
| H | 211626 | 8.5% |
| R | 132423 | 5.3% |
| C | 132423 | 5.3% |
| Value | Count | Frequency (%) |
| / | 798613 |
| Value | Count | Frequency (%) |
| - | 710331 |
| Value | Count | Frequency (%) |
| 710331 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15270079 | |
| Common | 2219275 | 12.7% |
Most frequent character per script
| Value | Count | Frequency (%) |
| o | 2841324 | |
| m | 1420662 | |
| b | 1420662 | |
| r | 1420662 | |
| i | 1209036 | |
| a | 921957 | 6.0% |
| e | 921957 | 6.0% |
| B | 798613 | 5.2% |
| T | 710331 | 4.7% |
| f | 710331 | 4.7% |
| Other values (9) | 2894544 |
| Value | Count | Frequency (%) |
| / | 798613 | |
| - | 710331 | |
| 710331 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17489354 |
Most frequent character per block
| Value | Count | Frequency (%) |
| o | 2841324 | |
| m | 1420662 | 8.1% |
| b | 1420662 | 8.1% |
| r | 1420662 | 8.1% |
| i | 1209036 | 6.9% |
| a | 921957 | 5.3% |
| e | 921957 | 5.3% |
| / | 798613 | 4.6% |
| B | 798613 | 4.6% |
| T | 710331 | 4.1% |
| Other values (12) | 5025537 |
ground_floor_type
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 43.6 MiB |
| Mud | |
|---|---|
| RC | |
| Brick/Stone | |
| Timber | 3546 |
| Other | 1047 |
Length
| Max length | 11 |
|---|---|
| Median length | 3 |
| Mean length | 3.61503409 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2727442 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RC |
|---|---|
| 2nd row | Mud |
| 3rd row | Mud |
| 4th row | Mud |
| 5th row | Mud |
| Value | Count | Frequency (%) |
| Mud | 611997 | |
| RC | 72418 | 9.6% |
| Brick/Stone | 65464 | 8.7% |
| Timber | 3546 | 0.5% |
| Other | 1047 | 0.1% |
| Value | Count | Frequency (%) |
| mud | 611997 | |
| rc | 72418 | 9.6% |
| brick/stone | 65464 | 8.7% |
| timber | 3546 | 0.5% |
| other | 1047 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 611997 | |
| u | 611997 | |
| d | 611997 | |
| R | 72418 | 2.7% |
| C | 72418 | 2.7% |
| r | 70057 | 2.6% |
| e | 70057 | 2.6% |
| i | 69010 | 2.5% |
| t | 66511 | 2.4% |
| B | 65464 | 2.4% |
| Other values (11) | 405516 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1769624 | |
| Uppercase Letter | 892354 | |
| Other Punctuation | 65464 | 2.4% |
Most frequent character per category
| Value | Count | Frequency (%) |
| u | 611997 | |
| d | 611997 | |
| r | 70057 | 4.0% |
| e | 70057 | 4.0% |
| i | 69010 | 3.9% |
| t | 66511 | 3.8% |
| c | 65464 | 3.7% |
| k | 65464 | 3.7% |
| o | 65464 | 3.7% |
| n | 65464 | 3.7% |
| Other values (3) | 8139 | 0.5% |
| Value | Count | Frequency (%) |
| M | 611997 | |
| R | 72418 | 8.1% |
| C | 72418 | 8.1% |
| B | 65464 | 7.3% |
| S | 65464 | 7.3% |
| T | 3546 | 0.4% |
| O | 1047 | 0.1% |
| Value | Count | Frequency (%) |
| / | 65464 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2661978 | |
| Common | 65464 | 2.4% |
Most frequent character per script
| Value | Count | Frequency (%) |
| M | 611997 | |
| u | 611997 | |
| d | 611997 | |
| R | 72418 | 2.7% |
| C | 72418 | 2.7% |
| r | 70057 | 2.6% |
| e | 70057 | 2.6% |
| i | 69010 | 2.6% |
| t | 66511 | 2.5% |
| B | 65464 | 2.5% |
| Other values (10) | 340052 |
| Value | Count | Frequency (%) |
| / | 65464 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2727442 |
Most frequent character per block
| Value | Count | Frequency (%) |
| M | 611997 | |
| u | 611997 | |
| d | 611997 | |
| R | 72418 | 2.7% |
| C | 72418 | 2.7% |
| r | 70057 | 2.6% |
| e | 70057 | 2.6% |
| i | 69010 | 2.5% |
| t | 66511 | 2.4% |
| B | 65464 | 2.4% |
| Other values (11) | 405516 |
other_floor_type
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 52.2 MiB |
| TImber/Bamboo-Mud | |
|---|---|
| Timber-Planck | |
| Not applicable | |
| RCC/RB/RBC | 32407 |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 15.58275986 |
| Min length | 10 |
Characters and Unicode
| Total characters | 11756756 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not applicable |
|---|---|
| 2nd row | TImber/Bamboo-Mud |
| 3rd row | TImber/Bamboo-Mud |
| 4th row | TImber/Bamboo-Mud |
| 5th row | TImber/Bamboo-Mud |
| Value | Count | Frequency (%) |
| TImber/Bamboo-Mud | 482049 | |
| Timber-Planck | 122371 | 16.2% |
| Not applicable | 117645 | 15.6% |
| RCC/RB/RBC | 32407 | 4.3% |
| Value | Count | Frequency (%) |
| timber/bamboo-mud | 482049 | |
| timber-planck | 122371 | 14.0% |
| not | 117645 | 13.5% |
| applicable | 117645 | 13.5% |
| rcc/rb/rbc | 32407 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| b | 1204114 | 10.2% |
| m | 1086469 | 9.2% |
| o | 1081743 | 9.2% |
| a | 839710 | 7.1% |
| e | 722065 | 6.1% |
| T | 604420 | 5.1% |
| r | 604420 | 5.1% |
| - | 604420 | 5.1% |
| / | 546863 | 4.7% |
| B | 546863 | 4.7% |
| Other values (16) | 3915669 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7937989 | |
| Uppercase Letter | 2549839 | 21.7% |
| Dash Punctuation | 604420 | 5.1% |
| Other Punctuation | 546863 | 4.7% |
| Space Separator | 117645 | 1.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| b | 1204114 | |
| m | 1086469 | |
| o | 1081743 | |
| a | 839710 | |
| e | 722065 | |
| r | 604420 | |
| u | 482049 | |
| d | 482049 | |
| l | 357661 | 4.5% |
| i | 240016 | 3.0% |
| Other values (5) | 837693 |
| Value | Count | Frequency (%) |
| T | 604420 | |
| B | 546863 | |
| I | 482049 | |
| M | 482049 | |
| P | 122371 | 4.8% |
| N | 117645 | 4.6% |
| R | 97221 | 3.8% |
| C | 97221 | 3.8% |
| Value | Count | Frequency (%) |
| 117645 |
| Value | Count | Frequency (%) |
| / | 546863 |
| Value | Count | Frequency (%) |
| - | 604420 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10487828 | |
| Common | 1268928 | 10.8% |
Most frequent character per script
| Value | Count | Frequency (%) |
| b | 1204114 | |
| m | 1086469 | 10.4% |
| o | 1081743 | 10.3% |
| a | 839710 | 8.0% |
| e | 722065 | 6.9% |
| T | 604420 | 5.8% |
| r | 604420 | 5.8% |
| B | 546863 | 5.2% |
| I | 482049 | 4.6% |
| M | 482049 | 4.6% |
| Other values (13) | 2833926 |
| Value | Count | Frequency (%) |
| - | 604420 | |
| / | 546863 | |
| 117645 | 9.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11756756 |
Most frequent character per block
| Value | Count | Frequency (%) |
| b | 1204114 | 10.2% |
| m | 1086469 | 9.2% |
| o | 1081743 | 9.2% |
| a | 839710 | 7.1% |
| e | 722065 | 6.1% |
| T | 604420 | 5.1% |
| r | 604420 | 5.1% |
| - | 604420 | 5.1% |
| / | 546863 | 4.7% |
| B | 546863 | 4.7% |
| Other values (16) | 3915669 |
position
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.1 MiB |
| Not attached | |
|---|---|
| Attached-1 side | |
| Attached-2 side | 26649 |
| Attached-3 side | 1293 |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 12.62052402 |
| Min length | 12 |
Characters and Unicode
| Total characters | 9521832 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not attached |
|---|---|
| 2nd row | Attached-1 side |
| 3rd row | Not attached |
| 4th row | Not attached |
| 5th row | Not attached |
| Value | Count | Frequency (%) |
| Not attached | 598416 | |
| Attached-1 side | 128114 | 17.0% |
| Attached-2 side | 26649 | 3.5% |
| Attached-3 side | 1293 | 0.2% |
| Value | Count | Frequency (%) |
| attached | 598416 | |
| not | 598416 | |
| side | 156056 | 10.3% |
| attached-1 | 128114 | 8.5% |
| attached-2 | 26649 | 1.8% |
| attached-3 | 1293 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2107360 | |
| a | 1352888 | |
| e | 910528 | |
| d | 910528 | |
| 754472 | 7.9% | |
| c | 754472 | 7.9% |
| h | 754472 | 7.9% |
| N | 598416 | 6.3% |
| o | 598416 | 6.3% |
| A | 156056 | 1.6% |
| Other values (6) | 624224 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7700776 | |
| Uppercase Letter | 754472 | 7.9% |
| Space Separator | 754472 | 7.9% |
| Dash Punctuation | 156056 | 1.6% |
| Decimal Number | 156056 | 1.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| t | 2107360 | |
| a | 1352888 | |
| e | 910528 | |
| d | 910528 | |
| c | 754472 | 9.8% |
| h | 754472 | 9.8% |
| o | 598416 | 7.8% |
| s | 156056 | 2.0% |
| i | 156056 | 2.0% |
| Value | Count | Frequency (%) |
| 1 | 128114 | |
| 2 | 26649 | 17.1% |
| 3 | 1293 | 0.8% |
| Value | Count | Frequency (%) |
| N | 598416 | |
| A | 156056 | 20.7% |
| Value | Count | Frequency (%) |
| 754472 |
| Value | Count | Frequency (%) |
| - | 156056 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8455248 | |
| Common | 1066584 | 11.2% |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 2107360 | |
| a | 1352888 | |
| e | 910528 | |
| d | 910528 | |
| c | 754472 | 8.9% |
| h | 754472 | 8.9% |
| N | 598416 | 7.1% |
| o | 598416 | 7.1% |
| A | 156056 | 1.8% |
| s | 156056 | 1.8% |
| Value | Count | Frequency (%) |
| 754472 | ||
| - | 156056 | 14.6% |
| 1 | 128114 | 12.0% |
| 2 | 26649 | 2.5% |
| 3 | 1293 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9521832 |
Most frequent character per block
| Value | Count | Frequency (%) |
| t | 2107360 | |
| a | 1352888 | |
| e | 910528 | |
| d | 910528 | |
| 754472 | 7.9% | |
| c | 754472 | 7.9% |
| h | 754472 | 7.9% |
| N | 598416 | 6.3% |
| o | 598416 | 6.3% |
| A | 156056 | 1.6% |
| Other values (6) | 624224 | 6.6% |
plan_configuration
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 48.8 MiB |
| Rectangular | |
|---|---|
| Square | 17411 |
| L-shape | 9979 |
| T-shape | 961 |
| Multi-projected | 930 |
| Other values (5) | 1275 |
Length
| Max length | 31 |
|---|---|
| Median length | 11 |
| Mean length | 10.82722222 |
| Min length | 6 |
Characters and Unicode
| Total characters | 8168836 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Rectangular |
|---|---|
| 2nd row | Rectangular |
| 3rd row | Rectangular |
| 4th row | Rectangular |
| 5th row | Rectangular |
| Value | Count | Frequency (%) |
| Rectangular | 723916 | |
| Square | 17411 | 2.3% |
| L-shape | 9979 | 1.3% |
| T-shape | 961 | 0.1% |
| Multi-projected | 930 | 0.1% |
| Others | 513 | 0.1% |
| U-shape | 447 | 0.1% |
| E-shape | 138 | < 0.1% |
| Building with Central Courtyard | 98 | < 0.1% |
| H-shape | 79 | < 0.1% |
| Value | Count | Frequency (%) |
| rectangular | 723916 | |
| square | 17411 | 2.3% |
| l-shape | 9979 | 1.3% |
| t-shape | 961 | 0.1% |
| multi-projected | 930 | 0.1% |
| others | 513 | 0.1% |
| u-shape | 447 | 0.1% |
| e-shape | 138 | < 0.1% |
| building | 98 | < 0.1% |
| central | 98 | < 0.1% |
| Other values (3) | 275 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1477043 | |
| e | 755402 | |
| r | 743064 | |
| u | 742453 | |
| t | 726583 | |
| l | 725042 | |
| c | 724846 | |
| n | 724112 | |
| g | 724014 | |
| R | 723916 | |
| Other values (22) | 102361 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7401340 | |
| Uppercase Letter | 754668 | 9.2% |
| Dash Punctuation | 12534 | 0.2% |
| Space Separator | 294 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| a | 1477043 | |
| e | 755402 | |
| r | 743064 | |
| u | 742453 | |
| t | 726583 | |
| l | 725042 | |
| c | 724846 | |
| n | 724112 | |
| g | 724014 | |
| q | 17411 | 0.2% |
| Other values (9) | 41370 | 0.6% |
| Value | Count | Frequency (%) |
| R | 723916 | |
| S | 17411 | 2.3% |
| L | 9979 | 1.3% |
| T | 961 | 0.1% |
| M | 930 | 0.1% |
| O | 513 | 0.1% |
| U | 447 | 0.1% |
| C | 196 | < 0.1% |
| E | 138 | < 0.1% |
| B | 98 | < 0.1% |
| Value | Count | Frequency (%) |
| - | 12534 |
| Value | Count | Frequency (%) |
| 294 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8156008 | |
| Common | 12828 | 0.2% |
Most frequent character per script
| Value | Count | Frequency (%) |
| a | 1477043 | |
| e | 755402 | |
| r | 743064 | |
| u | 742453 | |
| t | 726583 | |
| l | 725042 | |
| c | 724846 | |
| n | 724112 | |
| g | 724014 | |
| R | 723916 | |
| Other values (20) | 89533 | 1.1% |
| Value | Count | Frequency (%) |
| - | 12534 | |
| 294 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8168836 |
Most frequent character per block
| Value | Count | Frequency (%) |
| a | 1477043 | |
| e | 755402 | |
| r | 743064 | |
| u | 742453 | |
| t | 726583 | |
| l | 725042 | |
| c | 724846 | |
| n | 724112 | |
| g | 724014 | |
| R | 723916 | |
| Other values (22) | 102361 | 1.3% |
has_superstructure_adobe_mud
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 31979 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 722493 | |
| 1 | 31979 | 4.2% |
| Value | Count | Frequency (%) |
| 0 | 722493 | |
| 1 | 31979 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 722493 | |
| 1 | 31979 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 722493 | |
| 1 | 31979 | 4.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 722493 | |
| 1 | 31979 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 722493 | |
| 1 | 31979 | 4.2% |
has_superstructure_mud_mortar_stone
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 603794 | |
| 0 | 150678 | 20.0% |
| Value | Count | Frequency (%) |
| 1 | 603794 | |
| 0 | 150678 | 20.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 603794 | |
| 0 | 150678 | 20.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 1 | 603794 | |
| 0 | 150678 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 1 | 603794 | |
| 0 | 150678 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 603794 | |
| 0 | 150678 | 20.0% |
has_superstructure_stone_flag
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 26506 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 727966 | |
| 1 | 26506 | 3.5% |
| Value | Count | Frequency (%) |
| 0 | 727966 | |
| 1 | 26506 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 727966 | |
| 1 | 26506 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 727966 | |
| 1 | 26506 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 727966 | |
| 1 | 26506 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 727966 | |
| 1 | 26506 | 3.5% |
has_superstructure_cement_mortar_stone
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 11947 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 742525 | |
| 1 | 11947 | 1.6% |
| Value | Count | Frequency (%) |
| 0 | 742525 | |
| 1 | 11947 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 742525 | |
| 1 | 11947 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 742525 | |
| 1 | 11947 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 742525 | |
| 1 | 11947 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 742525 | |
| 1 | 11947 | 1.6% |
has_superstructure_mud_mortar_brick
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 17334 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 737138 | |
| 1 | 17334 | 2.3% |
| Value | Count | Frequency (%) |
| 0 | 737138 | |
| 1 | 17334 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 737138 | |
| 1 | 17334 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 737138 | |
| 1 | 17334 | 2.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 737138 | |
| 1 | 17334 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 737138 | |
| 1 | 17334 | 2.3% |
has_superstructure_cement_mortar_brick
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 53933 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 700539 | |
| 1 | 53933 | 7.1% |
| Value | Count | Frequency (%) |
| 0 | 700539 | |
| 1 | 53933 | 7.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 700539 | |
| 1 | 53933 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 700539 | |
| 1 | 53933 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 700539 | |
| 1 | 53933 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 700539 | |
| 1 | 53933 | 7.1% |
has_superstructure_timber
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 559252 | |
| 1 | 195220 | 25.9% |
| Value | Count | Frequency (%) |
| 0 | 559252 | |
| 1 | 195220 | 25.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 559252 | |
| 1 | 195220 | 25.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 559252 | |
| 1 | 195220 | 25.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 559252 | |
| 1 | 195220 | 25.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 559252 | |
| 1 | 195220 | 25.9% |
has_superstructure_bamboo
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 60696 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 693776 | |
| 1 | 60696 | 8.0% |
| Value | Count | Frequency (%) |
| 0 | 693776 | |
| 1 | 60696 | 8.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 693776 | |
| 1 | 60696 | 8.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 693776 | |
| 1 | 60696 | 8.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 693776 | |
| 1 | 60696 | 8.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 693776 | |
| 1 | 60696 | 8.0% |
has_superstructure_rc_non_engineered
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 30017 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 724455 | |
| 1 | 30017 | 4.0% |
| Value | Count | Frequency (%) |
| 0 | 724455 | |
| 1 | 30017 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 724455 | |
| 1 | 30017 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 724455 | |
| 1 | 30017 | 4.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 724455 | |
| 1 | 30017 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 724455 | |
| 1 | 30017 | 4.0% |
has_superstructure_rc_engineered
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 12370 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 742102 | |
| 1 | 12370 | 1.6% |
| Value | Count | Frequency (%) |
| 0 | 742102 | |
| 1 | 12370 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 742102 | |
| 1 | 12370 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 742102 | |
| 1 | 12370 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 742102 | |
| 1 | 12370 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 742102 | |
| 1 | 12370 | 1.6% |
has_superstructure_other
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.7 MiB |
| 0 | |
|---|---|
| 1 | 9083 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 754472 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 745389 | |
| 1 | 9083 | 1.2% |
| Value | Count | Frequency (%) |
| 0 | 745389 | |
| 1 | 9083 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 745389 | |
| 1 | 9083 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 754472 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 745389 | |
| 1 | 9083 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 754472 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 745389 | |
| 1 | 9083 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 754472 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 745389 | |
| 1 | 9083 | 1.2% |
technical_solution_proposed
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 50.3 MiB |
| Reconstruction | |
|---|---|
| Major repair | |
| Minor repair | |
| No need |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 12.89381183 |
| Min length | 7 |
Characters and Unicode
| Total characters | 9728020 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No need |
|---|---|
| 2nd row | Reconstruction |
| 3rd row | Reconstruction |
| 4th row | Reconstruction |
| 5th row | No need |
| Value | Count | Frequency (%) |
| Reconstruction | 465543 | |
| Major repair | 128086 | 17.0% |
| Minor repair | 109497 | 14.5% |
| No need | 51346 | 6.8% |
| Value | Count | Frequency (%) |
| reconstruction | 465543 | |
| repair | 237583 | |
| major | 128086 | 12.3% |
| minor | 109497 | 10.5% |
| need | 51346 | 4.9% |
| no | 51346 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1220015 | |
| r | 1178292 | |
| n | 1091929 | |
| c | 931086 | |
| t | 931086 | |
| i | 812623 | |
| e | 805818 | |
| R | 465543 | 4.8% |
| s | 465543 | 4.8% |
| u | 465543 | 4.8% |
| Other values (7) | 1360542 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8684619 | |
| Uppercase Letter | 754472 | 7.8% |
| Space Separator | 288929 | 3.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| o | 1220015 | |
| r | 1178292 | |
| n | 1091929 | |
| c | 931086 | |
| t | 931086 | |
| i | 812623 | |
| e | 805818 | |
| s | 465543 | 5.4% |
| u | 465543 | 5.4% |
| a | 365669 | 4.2% |
| Other values (3) | 417015 | 4.8% |
| Value | Count | Frequency (%) |
| R | 465543 | |
| M | 237583 | |
| N | 51346 | 6.8% |
| Value | Count | Frequency (%) |
| 288929 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9439091 | |
| Common | 288929 | 3.0% |
Most frequent character per script
| Value | Count | Frequency (%) |
| o | 1220015 | |
| r | 1178292 | |
| n | 1091929 | |
| c | 931086 | |
| t | 931086 | |
| i | 812623 | |
| e | 805818 | |
| R | 465543 | 4.9% |
| s | 465543 | 4.9% |
| u | 465543 | 4.9% |
| Other values (6) | 1071613 |
| Value | Count | Frequency (%) |
| 288929 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9728020 |
Most frequent character per block
| Value | Count | Frequency (%) |
| o | 1220015 | |
| r | 1178292 | |
| n | 1091929 | |
| c | 931086 | |
| t | 931086 | |
| i | 812623 | |
| e | 805818 | |
| R | 465543 | 4.8% |
| s | 465543 | 4.8% |
| u | 465543 | 4.8% |
| Other values (7) | 1360542 |
damage_grade
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 46.0 MiB |
| Grade 5 | |
|---|---|
| Grade 4 | |
| Grade 3 | |
| Grade 2 | |
| Grade 1 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 5281304 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Grade 1 |
|---|---|
| 2nd row | Grade 5 |
| 3rd row | Grade 5 |
| 4th row | Grade 5 |
| 5th row | Grade 1 |
| Value | Count | Frequency (%) |
| Grade 5 | 273008 | |
| Grade 4 | 182006 | |
| Grade 3 | 135048 | |
| Grade 2 | 86384 | 11.4% |
| Grade 1 | 78026 | 10.3% |
| Value | Count | Frequency (%) |
| grade | 754472 | |
| 5 | 273008 | 18.1% |
| 4 | 182006 | 12.1% |
| 3 | 135048 | 8.9% |
| 2 | 86384 | 5.7% |
| 1 | 78026 | 5.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 754472 | |
| r | 754472 | |
| a | 754472 | |
| d | 754472 | |
| e | 754472 | |
| 754472 | ||
| 5 | 273008 | 5.2% |
| 4 | 182006 | 3.4% |
| 3 | 135048 | 2.6% |
| 2 | 86384 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3017888 | |
| Uppercase Letter | 754472 | 14.3% |
| Space Separator | 754472 | 14.3% |
| Decimal Number | 754472 | 14.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 5 | 273008 | |
| 4 | 182006 | |
| 3 | 135048 | |
| 2 | 86384 | 11.4% |
| 1 | 78026 | 10.3% |
| Value | Count | Frequency (%) |
| r | 754472 | |
| a | 754472 | |
| d | 754472 | |
| e | 754472 |
| Value | Count | Frequency (%) |
| G | 754472 |
| Value | Count | Frequency (%) |
| 754472 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3772360 | |
| Common | 1508944 | 28.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| 754472 | ||
| 5 | 273008 | 18.1% |
| 4 | 182006 | 12.1% |
| 3 | 135048 | 8.9% |
| 2 | 86384 | 5.7% |
| 1 | 78026 | 5.2% |
| Value | Count | Frequency (%) |
| G | 754472 | |
| r | 754472 | |
| a | 754472 | |
| d | 754472 | |
| e | 754472 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5281304 |
Most frequent character per block
| Value | Count | Frequency (%) |
| G | 754472 | |
| r | 754472 | |
| a | 754472 | |
| d | 754472 | |
| e | 754472 | |
| 754472 | ||
| 5 | 273008 | 5.2% |
| 4 | 182006 | 3.4% |
| 3 | 135048 | 2.6% |
| 2 | 86384 | 1.6% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | building_id | district_id | vdcmun_id | ward_id | legal_ownership_status | count_families | has_secondary_use | has_secondary_use_agriculture | has_secondary_use_hotel | has_secondary_use_rental | has_secondary_use_institution | has_secondary_use_school | has_secondary_use_industry | has_secondary_use_health_post | has_secondary_use_gov_office | has_secondary_use_use_police | has_secondary_use_other | count_floors_pre_eq | age_building | plinth_area_sq_ft | height_ft_pre_eq | land_surface_condition | foundation_type | roof_type | ground_floor_type | other_floor_type | position | plan_configuration | has_superstructure_adobe_mud | has_superstructure_mud_mortar_stone | has_superstructure_stone_flag | has_superstructure_cement_mortar_stone | has_superstructure_mud_mortar_brick | has_superstructure_cement_mortar_brick | has_superstructure_timber | has_superstructure_bamboo | has_superstructure_rc_non_engineered | has_superstructure_rc_engineered | has_superstructure_other | technical_solution_proposed | damage_grade | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 632841 | 312004091341 | 31 | 3104 | 310404 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 7 | 720 | 10 | Flat | RC | RCC/RB/RBC | RC | Not applicable | Not attached | Rectangular | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | No need | Grade 1 |
| 1 | 319471 | 240402000391 | 24 | 2408 | 240802 | Private | 1.0 | 1.0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 20 | 315 | 21 | Moderate slope | Mud mortar-Stone/Brick | Bamboo/Timber-Light roof | Mud | TImber/Bamboo-Mud | Attached-1 side | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Reconstruction | Grade 5 |
| 2 | 490564 | 286202000091 | 28 | 2812 | 281201 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 38 | 382 | 13 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Heavy roof | Mud | TImber/Bamboo-Mud | Not attached | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Reconstruction | Grade 5 |
| 3 | 215031 | 224202001381 | 22 | 2201 | 220106 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 80 | 200 | 12 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Heavy roof | Mud | TImber/Bamboo-Mud | Not attached | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Reconstruction | Grade 5 |
| 4 | 156516 | 214508000132 | 21 | 2106 | 210608 | Private | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 465 | 13 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Light roof | Mud | TImber/Bamboo-Mud | Not attached | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | No need | Grade 1 |
| 5 | 187577 | 221701001641 | 22 | 2201 | 220101 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 10 | 651 | 21 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Heavy roof | Mud | TImber/Bamboo-Mud | Not attached | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | Major repair | Grade 3 |
| 6 | 58192 | 202005000561 | 20 | 2009 | 200904 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 302 | 12 | Moderate slope | Bamboo/Timber | Bamboo/Timber-Heavy roof | Mud | Timber-Planck | Not attached | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | No need | Grade 3 |
| 7 | 503026 | 291802020431 | 29 | 2904 | 290402 | Private | 1.0 | 1.0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 60 | 398 | 15 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Light roof | Mud | Timber-Planck | Attached-2 side | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Reconstruction | Grade 5 |
| 8 | 552861 | 302804001931 | 30 | 3001 | 300102 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 22 | 263 | 18 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Light roof | Mud | TImber/Bamboo-Mud | Not attached | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Reconstruction | Grade 5 |
| 9 | 382732 | 246404000241 | 24 | 2408 | 240803 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 20 | 155 | 14 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Light roof | Mud | TImber/Bamboo-Mud | Not attached | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Reconstruction | Grade 2 |
Last rows
| df_index | building_id | district_id | vdcmun_id | ward_id | legal_ownership_status | count_families | has_secondary_use | has_secondary_use_agriculture | has_secondary_use_hotel | has_secondary_use_rental | has_secondary_use_institution | has_secondary_use_school | has_secondary_use_industry | has_secondary_use_health_post | has_secondary_use_gov_office | has_secondary_use_use_police | has_secondary_use_other | count_floors_pre_eq | age_building | plinth_area_sq_ft | height_ft_pre_eq | land_surface_condition | foundation_type | roof_type | ground_floor_type | other_floor_type | position | plan_configuration | has_superstructure_adobe_mud | has_superstructure_mud_mortar_stone | has_superstructure_stone_flag | has_superstructure_cement_mortar_stone | has_superstructure_mud_mortar_brick | has_superstructure_cement_mortar_brick | has_superstructure_timber | has_superstructure_bamboo | has_superstructure_rc_non_engineered | has_superstructure_rc_engineered | has_superstructure_other | technical_solution_proposed | damage_grade | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 754462 | 582560 | 304208000151 | 30 | 3009 | 300913 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 28 | 247 | 16 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Heavy roof | Mud | TImber/Bamboo-Mud | Not attached | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Reconstruction | Grade 4 |
| 754463 | 398019 | 247109001151 | 24 | 2401 | 240105 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 18 | 567 | 15 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Light roof | Mud | TImber/Bamboo-Mud | Attached-1 side | Rectangular | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | Reconstruction | Grade 5 |
| 754464 | 193783 | 222304000871 | 22 | 2204 | 220404 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 8 | 347 | 18 | Moderate slope | Mud mortar-Stone/Brick | Bamboo/Timber-Light roof | Mud | Timber-Planck | Not attached | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Reconstruction | Grade 5 |
| 754465 | 97897 | 204403000801 | 20 | 2007 | 200706 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 14 | 486 | 21 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Light roof | Mud | TImber/Bamboo-Mud | Not attached | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | Reconstruction | Grade 4 |
| 754466 | 397094 | 247101001681 | 24 | 2405 | 240501 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 7 | 495 | 16 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Light roof | Mud | TImber/Bamboo-Mud | Not attached | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Reconstruction | Grade 5 |
| 754467 | 191690 | 222105000421 | 22 | 2209 | 220903 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 8 | 230 | 21 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Heavy roof | Mud | TImber/Bamboo-Mud | Attached-1 side | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Reconstruction | Grade 4 |
| 754468 | 615995 | 311409000571 | 31 | 3102 | 310202 | Private | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 450 | 12 | Flat | Bamboo/Timber | Bamboo/Timber-Light roof | Mud | Timber-Planck | Not attached | Rectangular | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | Minor repair | Grade 3 |
| 754469 | 33748 | 124903000611 | 12 | 1208 | 120803 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 30 | 253 | 10 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Light roof | Mud | Not applicable | Not attached | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Reconstruction | Grade 5 |
| 754470 | 418013 | 280407000882 | 28 | 2801 | 280111 | Private | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 35 | 400 | 15 | Moderate slope | Mud mortar-Stone/Brick | Bamboo/Timber-Heavy roof | Mud | Timber-Planck | Attached-1 side | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Reconstruction | Grade 4 |
| 754471 | 607602 | 311004001611 | 31 | 3111 | 311109 | Private | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 80 | 210 | 17 | Flat | Mud mortar-Stone/Brick | Bamboo/Timber-Light roof | Mud | TImber/Bamboo-Mud | Attached-1 side | Rectangular | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Major repair | Grade 3 |